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I^MARKS/ARGUMENTS 

These remarks are made in response to the Office Action of August 25, 2004 
(Office Action), This response is filed with a petition for a one-month extension of time 
and with an api)ropriaTe fee. 

In the Office Action, the Examiner objected to claim 15 (and 35) under 35 U,S,C, 
§ 1 12. In response, Applicants have amended claims 15 and 35 so that the phrase "two 
consecutive words" now reads "a plurality of segraentable data elements." This 
amendment is supported by page 16 of the disclosure that shows how a tag 
FIRSTNAMEl ASTNAME includes data elements of FIRSTNAME (Pedro) and LAST 
NAME (Rome:^o). Responsive to this amendment, Applicants respectfully request that 
the rejections to 35 U.S.C. § 1 12 rejections to claims 15 (and 35) be withdrawn. 

Additionally, in the Office Action, the Examiner has rejected claims 1-4, 8-9, 12- 
13, 16-25, 28-29, 32-33, and 36 under 35 U.S.C, § 102(b) as being anticipated by U.S. 
Patent No. 5, 797, 123 to Chou, et ah (Chou). Claims 6-7, 10-11, 14-15, 26-27, 30-31, 
and 34-35 have been rejected ur4der 35 U,S.C. § 103(a) as being unpatentable over Chou. 

In response, Applicants have amended claims 1^ 17, 1.8, and 21 to clarify that the 
converting of an utterance into a basic speech unit occurs directly without first converting 
the utterance into a sequence of textually represented words. This clarification is 
supported by pf ge 3. lines 1-6, by page 3, linea 16-20, page 6, lines 5-8, page 2, lines 19- 
22, by FIG. 1, and by material presented throughout the specification. No new matter 
results from these amendmentfi. 

Prior to addressing the rejections on the art, a brief review of the Applicants' 
invention is in jrder. Conventional natural language understanding (NLU) applications 
are two stage applications that first convert a speech utterance into a textual 
representation f nd second compare the textual representation tc> a vocabulary of words. 
Thus, conventional NLU components process textual representations provided at the 
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word level. Tlie Applicants' claimed subject matter, teach a single stage NLU technique, 
where NLU cc mponents receive speech utterances/ as opposed to textual representations, 
and directly generate information required to process a user request (i.e. perform 
programmatic actions responsive to the spoken utterances). A NLU component capable 
of directly responding to speech utterances can operate within a confuting resource 
constrained environment, like within an embedded device or within a personal data 
assistant (VDJi). Further, a NLU that functions directly against non-textual speech 
utterances can operate without modification with input from any language, as lexical 
translations to a language-specific words are unnecessary. 

Turning; to the rejections on the art, claims 1-4, 8-9^ 12-13, 16-25, 28-29, 32-33, 
and 36 have bjen rejected und^r 35 U.S.C* § 102(b) as being anticipated by Chou. Chou 
discloses a method that examines a spoken utterance in a sentence formed from a 
plurality of wcTds for "key phrases" that are task related. Words in a sentence other than 
the key phrases can be discarded. Thus, Chou teaches that conventional grammar 
constraints shC'Uld be relaxed to focus upon a set of recognized key phrases (from column 
3, lines 37-42) 

As usee, in Chou (column 3, lines 27-30) a semencc means any sequence of words, 
regardless of v/hether or not such a sequence of words comprises a grammatically correct 
sentence structure. Further, "key phrases" (from column 3, lines 31-34) means a 
sequence of one or more word$. 

Like other conventional NLU processing components, the components of Chou 
rely on a two step process where speech utterances are received and converted into a 
textual rcpres?ntation. The textual rcpresetitations are processed by Chou on the 
"sentence" level, where "worda" (that are textual representations of a previously spoken 
utterance) are compared agaimt "key phrases" recorded in a grammar. 
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Chou teaches that when constructing the textual representations, words can be 
"built" by segmenting audible input into a series of sub-word units (syllables* phonemes, 
etc). Each sut-unit will correspond to textual representations and lexicon specific rules 
can be used to selectively choose which sub-word combinations should be used to 
construct accurate words. Textual representations of words can be formed by 
concatenating sub-word textual representations to form textual representations of full 
words. These textual representations of words can be processed by inventive 
arrangements disclosed within Chou to determine key phrases within sentences. 
Accordingly^ C hou teaches that within stage 2 (processing textual word representations of 
speech converted utterances) a further processing based upon key phrases can be 
beneficial. 

Referring to claim I, Applicants claim the steps of: 

(a) converting a user utterance directly into a plurality of basic speech units 
without converting the utterance Into a sequence of textuallv reoresepted words , 

said user utterance being a sequence of words expressing a query or a command; 

(b) matching said plurality of basic speech units against a plurality of 
combinations cf items, wherein each item is tagged data or is- a concept code; and 

(c) generating a combination of items likely to be representative of said user 
utterance. 

As previously noted, Chou teaches that speech utterances are to be converted into 
textual representations grouped as sentences, each sentence comprising a plurality of 
tcxtually rcprc;icntcd words (column 3, lines 27-30). Chou requires utters be converted 
into textual ^^ord5 before key phrases are identified and further processing occurs 
(column 4, lin<js 33-35, supported by column 5, lines 52-56 that states a subword based 
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speech recognizer can be adapted to recognize a set of key phrases (still converting 
utterances into textual words representations by definition - textual representations that 
can be matcheii by the adapted subword speech recognizer to the key phrases). 

This characterization of Chou is further evident by FIG. 1, where a separate 
component for subword models 22 conveys information to the key-phrase detector 1 1 
(i.e. textual representations are sent from the subword model 22 to the key-phrase 
detector 1 1 that searches for textual strings ("key phrases") built upon received textually 
rendered inpui: from the subword models 22.) That is, the key-phrase detector U can 
combine outfiut from the subword models 22 into sequences of textual word 
representation.'! that the key-phrase detector 1 1 recognizes. 

Accordingly, Chou does not teach or inherently contain the claimed limitation of 
converting a user utterance ^irectlv into a plurality of basic speech units without 
converting the utterance into, a sequence of textuallv represented words . Since under 

35 U.S.C. § l(»2(b) each claimed limitation must be taught or inherently contained within 
cited art, the rejection to claim 1 should be withdrawn, which action is respectfully 
requested. 

Further, claims 17, 18, and 21 also include the limitation of directly converting a 
user utterance into basic speech units without converting the utterance into a sequence of 
textually represented words. Claims 2-4, 8-9,. 12-13, 16, 19-20, 22-25, 28-29, 32-33, and 

36 are depencent upon claims I, 17, 18, and 21. Accordingly, the 35 U.S.C. § 102(b) 
rejections to claims 1-4, 8-9, 12-13, 16-25, 28-29, 32-33, and 36 should be withdrawn, 
which action is respectfUUy requested. 

Proceeding to other rejections in the Office Action, claims 6-7, 10-11, 14-15, 26- 
27, 30-31, and 34-35 have been rejected under 35 U.S.C § 103(a) as being unpatentable 
over Chou. As noted above, Chou teaches away from the Applicants' claimed limitation. 
That is, Chou teaches that utterances ARE to be converted into a sequence of textually 
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represented wcirds, md the Applicants claim the opposite. It should be appreciated that ■ 
Chou performs, the first stage of NLU processing according to conventional teachings, as 
disclosed in th<5 Applicants' background. 

It woulU not be obvious for one of ordinary skill in the art to use the Applicants* 
disclosed technique of directly converting an utterance into units without converting the 
utterance into a sequence of textually represented words bas-ed upon the teachings of 
Chou, which tjaches a post- processing methodology that is used AFTER utterances are 
converted inic a series of textually represented words. Consequently, Chou does not 
teach or suggest the Applicants claimed invention. Accordingly, the 35 U*S,C § 103(a) 
rejections to claims 6-7. 10-11. 14-15, 26-27, 30-3U and 34-35 should be withdrawn, 
which action i:5 respectfully requested, 

AppliCf nts believe that this application is now in full condition for allowance, 
which action is rcspcctiully requested. Applicants request that the Exanuner call the 
undersigned if clarification is needed on any matter within this Amendment, or if the 
Examiner believes a telephone interview would expedite the prosecution of the subject 
application to completion. 



Respectfully submitt£4, 





Gregory A. Nelson* Registration No, 30,577 
Richard A. Hinson, B.egistration No. 47,652 
Brian K. Buchbeit, Registration No, 52,667 
AKERMAN SENTEKFITT 
Customer No. 40987 
Post Office Box 3188 
West Palm Beach, H. 33402-3 188 
Telephone: (561) 653-5000 
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